Picture for Chen Zhang

Chen Zhang

SenseTime Research

Quantifying the Energy Floor: Direct Measurement and Replay Buffer Bias in SAC-Based HVAC Control on sbsim

Add code
Jun 01, 2026
Viaarxiv icon

ATLAS: All-round Testing of Long-context Abilities across Scales

Add code
May 27, 2026
Viaarxiv icon

Learning to Adapt SFT Data for Better Reasoning Generalization

Add code
May 26, 2026
Viaarxiv icon

OmniISR: A Unified Framework for Centralized and Federated Learning via Intermediate Supervision and Regularization

Add code
May 19, 2026
Viaarxiv icon

CineMatte: Background Matting for Virtual Production and Beyond

Add code
May 18, 2026
Viaarxiv icon

EvoMemBench: Benchmarking Agent Memory from a Self-Evolving Perspective

Add code
May 18, 2026
Viaarxiv icon

DRL-STAF: A Deep Reinforcement Learning Framework for State-Aware Forecasting of Complex Multivariate Hidden Markov Processes

Add code
May 14, 2026
Viaarxiv icon

PruneTIR: Inference-Time Tool Call Pruning for Effective yet Efficient Tool-Integrated Reasoning

Add code
May 11, 2026
Viaarxiv icon

UniSonate: A Unified Model for Speech, Music, and Sound Effect Generation with Text Instructions

Add code
Apr 24, 2026
Viaarxiv icon

TriEx: A Game-based Tri-View Framework for Explaining Internal Reasoning in Multi-Agent LLMs

Add code
Apr 21, 2026
Viaarxiv icon